[linux-nvidia-6.17] Use architecture specific HBM training status register by ankita-nv · Pull Request #331 · NVIDIA/NV-Kernels

ankita-nv · 2026-02-25T13:51:13Z

Blackwell-Next GPUs use a different BAR0 offset (0xAD00BC) for the HBM
training status register than GB200 (0x200BC). Add runtime detection by
reading the architecture field from PMC BOOT_42 and selecting the
appropriate offset when polling for device readiness.

Signed-off-by: Ankit Agrawal ankita@nvidia.com

nvmochs

This looks good to me.

Acked-by: Matthew R. Ochs <mochs@nvidia.com>

nvmochs · 2026-02-25T16:56:07Z

@ankita-nv Are there plans to upstream this patch?

nirmoy · 2026-02-25T17:01:59Z

Acked-by: Nirmoy Das<nirmoyd@nvidia.com>

ankita-nv · 2026-02-25T18:17:07Z

@ankita-nv Are there plans to upstream this patch?

Yeah, I'll post it shortly after internal review.

nvmochs · 2026-02-26T14:38:11Z

Ankit requested that we hold on getting this integrated.

…diness check Blackwell-Next GPUs report device readiness via the CXL DVSEC Range 1 Low register (offset 0x1C) instead of the BAR0 HBM training register used by GB200. The GPU memory readiness is checked by polling for the Memory_Active bit (bit 1) for the Memory_Active_Timeout (bits 15:13). Add runtime detection by checking the presence of the DVSEC register. Route to the new method if present, otherwise continue using the legacy approach. Signed-off-by: Ankit Agrawal <ankita@nvidia.com>

nvmochs changed the title ~~Use architecture specific HBM training status register~~ [linux-nvidia-6.17] Use architecture specific HBM training status register Feb 25, 2026

nvmochs self-requested a review February 25, 2026 16:54

nvmochs approved these changes Feb 25, 2026

View reviewed changes

ankita-nv force-pushed the 24.04_linux-nvidia-6.17-next-probe-fix-2502 branch 2 times, most recently from 58fc644 to 4b04466 Compare March 14, 2026 06:15

ankita-nv force-pushed the 24.04_linux-nvidia-6.17-next-probe-fix-2502 branch from 4b04466 to f400624 Compare March 14, 2026 06:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[linux-nvidia-6.17] Use architecture specific HBM training status register#331

[linux-nvidia-6.17] Use architecture specific HBM training status register#331
ankita-nv wants to merge 1 commit intoNVIDIA:24.04_linux-nvidia-6.17-nextfrom
ankita-nv:24.04_linux-nvidia-6.17-next-probe-fix-2502

ankita-nv commented Feb 25, 2026

Uh oh!

nvmochs left a comment

Uh oh!

nvmochs commented Feb 25, 2026

Uh oh!

nirmoy commented Feb 25, 2026

Uh oh!

ankita-nv commented Feb 25, 2026

Uh oh!

nvmochs commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ankita-nv commented Feb 25, 2026

Uh oh!

nvmochs left a comment

Choose a reason for hiding this comment

Uh oh!

nvmochs commented Feb 25, 2026

Uh oh!

nirmoy commented Feb 25, 2026

Uh oh!

ankita-nv commented Feb 25, 2026

Uh oh!

nvmochs commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants